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PROOF OF POINCARE'S GEOMETRIC THEOREM 



GEORGE D. BIRKHOFF* 

In a paper recently published in the Rendiconti del Circolo 
Matematico di Palermo (vol. 33, 1912, pp. 375-407) and en- 
titled Sur un theoreme de GSometrie, Poincare enunciated a theorem of great 
importance, in particular for the restricted problem of three bodies; but, 
having only succeeded in treating a variety of special cases after long-con- 
tinued efforts, he gave out the theorem for the consideration of other mathe- 
maticians. 

For some years I have been considering questions of a character similar to 
that presented by the theorem and it now turns out that methods which I 
have been using are here applicable. In the present paper I give the brief 
proof which I have obtained, but do not take up other results to which I 
have been led.f 

1. Statement of the Theorem. Poincare's theorem may be stated in a 
simple form as follows: Let us suppose that a continuous one-to-one trans- 
formation T takes the ring R , formed by concentric circles C a and d of radii 
a and b respectively (a > b > 0), into itself in such a way as to advance the 
points of C a in a positive sense, and the points of Ch in the negative sense, and 
at the same time to preserve areas. Then there are at least two invariant 
points. 

In the proof of this theorem we shall use modified polar coordinates y = r 2 , 
x = 8 where r is the distance of the point ( x , y ) from the center of the circles, 
and 8 is the angle which a line from the center to ( x , y) makes with a fixed 
line through the center. The transformation T may be written then 

x' = <p (x, y), y' = ^ (x, y). 

The function ij/ (x, y) is a continuous function of (x , y) , uniquely deter- 
mined at each point of R , and so is periodic in x of period 2-ir . The function 
<p(x,y) admits of an infinite number of determinations which differ from each 

* Presented to the Society, October 26, 1912. 

t Some of my results are contained in a paper entitled Quelques tMoremes sur les mouve- 
ments des systkmes dynamiques, which is shortly to appear in the Bulletin de la Societe 
Math6matique de France. 
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other by integral multiples of 2-ir , and these determinations can be grouped so 
as to form continuous branches. Since ( x + 2ir , y ) and ( x , y ) represent the 
same point of R, the algebraic difference between the values of one of these 
determinations taken f or ( x + 2ir , y ) and ( x , y ) reduces to an integral mul- 
tiple of 2w; and this difference must be one and the same multiple of 27r for 
all x and y because the difference is a continuous function. But if the point 
( x , y ) makes a positive circuit of the circle C a , the same is true of its image 
(x' , y'); hence along this path <p (x , y) increases by 27r when x increases 
by 2w . Thus the difference reduces identically to 2w, in other words, the 
function <p (x,y) increases by 2w when x is increased by 2ir . 

In consequence of these properties of <p ( x , y ) and ^ ( x , y ) , it is clear that 
x' — x and y' — y are both single-valued and continuous in R . 

The precise meaning of the theorem is that if any determination of <p ( x , y ) 
is made for which 

x' > x along C a and x' < x along Cb , 

(the conditions on T make possible such a choice) then we shall have at least 
two points ( x , y ) of R for which 

x' = x, y' = y. 

2. On the Method of Proof. As Poincare remarks (loc. cit., p. 377), the 
existence of one invariant point implies immediately the existence of a second 
invariant point. Hence if the theorem is false we may assume that there is no 
point invariant for T . In this case we shall have 

(i) (x'-xy+(y'- y y>cP>o 

for all points of R since x' — x and y' — y are single-valued, continuous, and 
not simultaneously zero, over the ring R. We shall establish the theorem 
by proving that the hypothesis (1) admits a reductio ad absurdum. 

3. The Auxiliary Transformation. If < e < b 2 , the one-to-one continuous 
transformation T e given by 

x' = x, y' = y — e 

takes the circles C a and Ct, into the concentric circles C' a : y = o 2 — e and 
C b :y — b 2 — e respectively, so that C' a is within C a at a distance a — V a 2 — e 
from C a , and C' b is similarly within Cb at a distance b — Vb 2 — e from Cb • 
This transformation effects a shrinking of the plane toward the origin which 
leaves every point on its radius vector and which preserves areas, since dx 
and dy are unaltered by the transformation and the integral of areas is 



I I rdrdd = \ j j dx dy . 
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Now, as long as e < d, the auxiliary transformation TT e , formed by fol- 
lowing T by T, , also has no invariant point. For if we take x and y as the 
rectangular coordinates of a point in the strip 8: 

— co < a: < -f- <x> , 6 2 <2/<o 2 , 

corresponding to the ring R , we see that the point ( x , y) is displaced at least 
d units in S by T and then is further displaced a distance e by T t in the direc- 
tion of the negative y-axis, so that if e < d , the point cannot come back to its 
initial position. We write this compound transformation TT e in the form 

x'=<p{x,y), y' = yj/{x, y) — e. 

Let us choose the positive quantity e once for all and so small that 

(2) «<6 2 , €<d, e<a 2 -b 2 . 
Consider now the multiple-valued function 

y' — y 

(3) a O, y) = arc tan ^, _ ^ , 

or, more accurately, those branches of this function which give the angle that 
the vector drawn from the point ( x , y ) to the point ( x' , y' ) in the strip S 
makes with the positive direction of the rc-axis (the other branches corre- 
sponding to the negative of this vector). In virtue of (1), this function is 
continuous at every point of S and accordingly falls into branches single- valued 
and continuous throughout S . 

Moreover any such branch reduces to a fixed even and odd multiple of 
■k along C a and C& respectively, since along C a we have x' > x, y' = y and 
along Cb , x' < x , y' = y . 

The functions y' — y and x' — x have been seen to be periodic in x of period 
2tt , and so any such branch of « ( x , y ) differs at ( x + 2tt , y ) and (x, y) 
by a multiple of 2w which may reduce to zero. Since the branches are con- 
tinuous, this multiple is one and the same throughout S . But along C a and 
Cb these branches have a constant value, as we have noted. Hence the mul- 
tiple will in fact reduce to zero. Thus these branches of w ( x , y ) are periodic 
in x of period 2tt , i. e., are single-valued in R. 

Likewise it is clear that the multiple-valued function 

(4) w(x, y) = arc tan- r _^, 

which gives the angle that the vector drawn from (x, y) to (x' , y') makes 
with the positive direction of the rc-axis, falls into single-valued and continous 
branches. Moreover the functions y' — y = y' — y — e and x'—x—x'—x are 
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periodic in x of period 2x; also each of the branches of u> (x, y) is periodic in 
x along C a and along C& . We conclude thatthe branches of w ( x , y ) as 
well as of a (x , y) are periodic in x of period 2x, i. e., are single- valued in R. 
The branches of w ( x , y ) and w ( a; , y ) may be associated in pairs so that 
the maximum numerical difference of any pair for every (x, y) is regulated 
in accordance with the formula 

(5) | u (x, y) - w (x, y) \ < ^. 

This is obvious geometrically, for the point ( x' , y' ) is at least d units distant 
from the point ( x , y ) in S , and ( x' , y' ) is distant e < d units from ( x' , y') 
so that the angle subtended at ( x , y) by these last-named points can never 
become equal to \tv . Thus if we associate the branches at a single point of S 
in accordance with (5), this inequality must continue to be true as the co- 
ordinates of the point vary continuously in S . 

4. Construction of a Curve Invariant for the Auxiliary Transformation. 
The transformation TT, of R takes C a to the circle C' a of radius V a 2 — e, 
between C a and C b , since we chose e < a 2 — b 2 . The repetition of TT e takes 
C' a into C'a , a simple closed curve. Now TT, is a one-to-one and continuous 
transformation of the ring R into the ring R' , limited by C' a , C' b and obtained 
by letting R shrink under the transformation T t . Since C' a lies within R and 
encloses Cb , its image C„ must lie within R' (and accordingly within C' a ) and 
enclose C' h . Furthermore the transformation TT t takes the ring formed by 
C a C'a in R into a second ring C' a C'a in R' , which abuts on the inner boundary C' a 
of the first ring . If C„' lies wholly in R it will enclose Cb as well as C' b , 
and the image C„" of C'a by TT, will be a simple curve within C a ' and enclosing 
C' b . Thus C a ' Ca' will form the boundary of a third ring abutting on the 
inner boundary of the second ring C' a C' a ' . This process may be continued 
to form simple curves C' a , C'„ , • • • lying one within the other , and correspond- 
ing rings C a C a , C a C'a , • • • , so long as the curves continue to lie wholly in 
R and not in part or wholly within Cb . 

The area of the ring C a C' a is -ire and will, since the transformations T and T t 
each preserve areas, be the same as that of its image rings C' a C'a , C'a C'a' , 
This series of rings can only terminate when a curve Ci n) is reached ( n ^ 2 ) 
part, at least, of which lies within the circle C b . But for every / , the area in- 
cluded between C a and CJ ) is wle , which if I is sufficiently large exceeds the 
area of R. It follows that such a curve C£° exists. Hence we can find a 
particular point P of C a whose ?ith image lies inside of the circle C b • 

Now let us turn to the strip S and to one of the representations of P on the 
upper side C a of the strip. Here C' a is represented by a straight line at distance 
e below C a , and C' a ,C' a ' , • • • are each represented by a simple open curve 

Trans. Am. Matb. Soc. 3 
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congruent by sections 2kw < a; < 2{h + 1) ir and extending indefinitely to 
right and left. The rings C a C' a , C' a C'a , ■ ■ • clearly are represented by the 
successive strata between successive curves of this set in S . Let P' be the 
first image of P and join PP' by a straight line which will lie entirely within 
the strip C a C' a . Let the arc P' P" be the image of PP' under TT, , and let 
P" P'" be the image of P' P" under the same transformation, and so on. In 
this way we construct successively PP' ,P'P", • • • , P< n-1 > P< n > lying respec- 
tively in the strata C a C' a ,C' a C'L , ■ ■ ■ , C^C? in S, the end point P (n > of 
the last of these arcs falling below the lower side C& of S . 

Let Q be the first intersection of this succession of arcs with the lower side 




C b of the strip (see figure). It is obvious that the curve PQ formed by this 
succession of arcs is a simple curve, for PP' , P' P" , • • • are successive simple 
arcs which lie in the successive strata C a C' a , C' a Ca , • • • in S . Furthermore 
PQ lies wholly between C a and Cb . 

The image P' Q' of PQ under the transformation TT, is made up of P' Q 
and an extended simple arc QQ' , the image of the arc Q _1 Q where Q -1 is 
the point that goes into Q by TT, . The arc QQ' being the image of points 
of R by TT, lies wholly below the straight line C' a ; the end-point Q' of this 
arc lies of course C' h . Furthermore QQ' has no point but Q in common with 
PQ . For if such a point exists it must lie on P' Q , and by performing the 
transformation inverse to TT, we see that Q~ X Q has a point other than Q" 1 
in common with PQ~ X , which is not possible since PQ has no multiple points. 
Thus PQ' forms a simple curve. 

The transformation TT, takes the arc PQ of PQ' into the arc P' Q' of PQ' , 
advancing each point of PQ along PQ'. In this sense the curve PQ is in- 
variant under the transformation. 

The properties of T and T, ensure that P' has an x greater than that of P , 
and that Q' has an x less than that of Q , as indicated in the figure. 

5. The Rotation of the Auxiliary Point-Image Vector on the Invariant 
Curve. If a point B moves along PQ' from P to Q , it is clear that its image B' 
by TT, moves from P' to Q' along the same curve, never coinciding with B. 
We shall now establish the fact (intuitively almost self-evident) that the 
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corresponding rotation of the vector BB' thus obtained is — t plus the sum 
of the two acute angles which the straight lines PP' and QQ' make with the 
ar-axis. 

At the outset it is obvious that the rotation can differ from this value only 
• by a multiple of 2w . That the rotation has precisely the value stated 
depends entirely on simple considerations of analysis situs. The fact on which 
this conclusion rests is that a continuous deformation of the curve PP' QQ' 
through a series of simple curves containing P , P' , Q , Q' brings the curve to 
the broken line position PP' QQ' (see figure). 

Let t be any monotonic parameter for the curve PQ taking on the increasing 
values t , t' , <i , t\ , at P , P' , Q , Q' respectively. Let t ( t ) be the value of 
this parameter for B' where t is its value for the corresponding point B . Clearly 
t (t) is a continuous increasing function of t (t ^ t < h) which has the 
property t ( t) > t . Consider now any varied simple curve through the same 
four points P , P' , Q , Q' on which a monotonic parameter t is so chosen that as 
before t , t'o,ti, t[ correspond to P , P' , Q, Q' respectively.* If the distance 
between any point of the varied curve and the point of PQ with the same 
parameter value is uniformly small, the corresponding vector BB' along the 
varied curve will undergo precisely the same total rotation as along PQ , 
since the initial and final positions are the same in either case and the 
angular differences of the vectors in all intermediate positions are uniformly 
small also. 

As a consequence of this reasoning it follows that we may deform the curve 
PP' QQ' continuously through any series of simple curves containing the same 
four points P , P' , Q , Q' provided that for the varied curves the parameter 
is properly chosen, and the total rotation of BB' will not thereby be altered. 

The curve PQ lies wholly in S , so that the straight line QQ' , being out- 
side of S , does not intersect the curve PQ . It is therefore apparent that 
the arc QQ' may be continuously deformed to this straight line position QQ' 
without taking the curve QQ' outside of the strip formed by C' a and C' b ; for the 
continuum formed by this strip is simply-connected after a cut in it by the 
curve P'Q is made, and hence any simple curve joining the points Q and Q' 
of this continuum, and lying in it, can be continuously deformed into any 
other such simple curve through a series of simple arcs not having any 
points but Q and Q' in common with the boundary of the cut strip. 

Next, the arc P' Q may be continously deformed, through a series of simple 
arcs joining P' to Q in the strip C' a C& in which P' Q wholly lies, into the 
straight line PQ , inasmuch as the straight lines QQ' and PP' have no points 
within this strip. 

* I include in the term " varied simple curve " any simple curves with the same geometric 
locus but with a different parameter. 
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Hence the rotation of BB' during its first series of positions is precisely the 
same as it is along the broken line formed by the three straight segments 
PP' , P' Q, QQ' . The rotation along the broken line from the initial to the 
final position is clearly — it plus the sum of the acute angles which the vectors 
PP' and QQ' make with the a>axis. Thus our statement is proved. 

In the first series of positions, B' is obtained from B by the transformation 
TT e . If the coordinates of B are ( x , y ) , those of B' are accordingly ( x' , y' ) , 
and the rotation of BB' is measured by the change in the function u> (x, y) as 
( x , y ) moves from P to Q along the invariant curve PQ . 

Let us fix upon that continuous branch &n ( x , y ) of this function which is 
measured at the point P by the negative of the acute angle which the vector 
PP' makes with the x-axis. At the point Q this determination will have the 
value — 7r plus the acute angle which the vector QQ' makes with the a>axis. 

6. The Rotation of the Point-Image Vector for T. Now fix upon that 
continuous branch coi ( x , y ) of the function co ( x , y ) which, along the upper 
side C of S , takes on the value zero. 

These two functions wi ( x , y ) and o>i(x,y) differ by less than fir at the point 
P and hence are branches of w (x,y) and co ( x , y ) associated by the inequality 
(5), which holds throughout S . Moreover the terminal value of the function 
<3i (x , y) has been shown to differ from — t by less than \-k at Q, so that 
coi ( x , y ) differs from — t by less than t at Q . Any branch of the function 
co ( x , y ) however has been seen to be precisely equal to a fixed odd multiple 
of 7T along Cb ■ Hence this function &>i ( x , y) must have the value — it at 
Q and also at all points of Cb . 

The variation of o>i ( x , y ) is therefore — ir when the point ( x , y ) moves in 
any manner whatever from a point of C a to a point of Cb. In other words 
if we let the point B move in any manner from C a to Cb in S , and let B' denote 
the image of B by the transformation T , the total rotation in the vector BB' will 
be precisely — ir . 

7. Completion of the Proof. Consider now the transformation T~ x inverse 
to T , which is in every respect similar to T except that points on C„ and Cb , 
and hence on the two sides of the strip S , are moved in the reverse direction. 
By symmetry, the vector B'B which joins B' to its image B under T~ l must 
now rotate through an angle + t as B and B' move from C a to Cb . 

Here the vector B'B is of sense opposite to that of BB'. But the actual 
rotation of the vectors BB' and B'B is of course one and the same quantity, so 
that a contradiction has been reached. Hence the theorem is completely 
proved. 

8. Generalizations of the Theorem. In phrasing his geometric theorem Poin- 
care makes the hypothesis that we have some integral invariant J J P(x,y)dxdy 
(P (x , y) > 0), not necessarily the invariant of areas. However, we can, 
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by a suitable change of coordinates from (x, y) to ( £ , 77 ) , change the 
integral invariant to the simpler area invariant. 

Let the lines 77 = const, be the circles concentric with C a and Cb, and 
let the number 77 (y) be so chosen for each circle that as a point moves from 
Cft to C a the double integral taken over the ring between Cb and the concentric 
circle through the moving point is equal to 77 : 



V(y)= f.{fo* P( - X ' V)dX \ 



dy. 



This function 77 (y) is clearly continuous and increasing and has a positive 
continuous derivative, namely 



dy (y) 



/»2ir 

= I P (x, y)dx. 



dy 

Speaking somewhat inexactly, the 77-curves are all so placed as to measure 
off equal increments of J j P ( x , y ) dx dy f or equal increments of y . Now 
(speaking in the same inexact sense) choose the ^-curves so as to measure off 
equal increments of the same double integral between these successive 77-curves 
and the line x = . 

The possibility of making such a choice of curves £ = const, may be seen 
as follows: Let x = / (y) be any curve lying in R, joining C a to Cb, and such 
that the part of the area between the initial line x = , this curve, the circle 
Cb and the circle y = const, is constantly proportional to y ( y ) : 

Xy f rf<y) I fc 

I Jo P( - X ' y ^ dX \ dy = 2r' n ^ yS> ' 

In view of the above mentioned properties of y ( y ) , this equation is equivalent 
to 



(6) P(x,y)dx = 



I 



■ /( " 1 k_ d v (y) 

!x dy 



For a given k , the quantity/ (y ) is clearly a single-valued continuous function 
of y since P ( x , y) and dt\ (y) / dy are positive and continuous. Moreover 
an increase in k continuously increases f (y) . Thus, we get a set of non- 
intersecting curves x = f (y) which, like the circles y = const., fill up the 
ring R . It is necessary to notice that x = corresponds to k = , and that 
x = 2t is the curve for k = 2x by definition of y ( y ) . We take k as deter- 
mined by the above method to be the coordinate £ of any point on the curve 
x=f{y). 

Thus we have coordinates ( £ , y ) for which the integral invariant is the 
invariantj J dl-dy . Taking (§ , 77) as the modified polar coordinates of a point 
in a new plane, we find that, expressed in these coordinates, T has all the 
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properties specified for T in the theorem. Thus we infer that there are at 
least two invariant points as before. 

A further generalization is that the curves C a and Cb may be allowed to be 
any simply closed curves, one within the other, bounding a ring R; we may 
again state a similar theorem , for we can make a preliminary transformation, 
for instance a conformal one, to take these curves into concentric circles, when 
of course the integral invariant merely changes form. Here it may be necessary 
to consider with care the nature of the integral invariant near the boundaries 
after the transformation of coordinates. 

Finally we may permit the function P ( x , y ) to vanish at some or all points 
of the curves C a and Cb ■ Under certain restrictions it is certain that in this 
limiting case there will be invariant points also. 

9. Poincare's Method. It is interesting to notice that Poincare uses only 
a single property consequent on the existence of an invariant integral, namely 
that no continuum on R can be transformed into part of itself by the transfor- 
mation T (loc. cit.. p. 377). It seems improbable that this condition is equiva- 
lent to the condition that there exists an invariant integral. The existence of 
such an invariant integral is a fact which enters more intimately into the proof 
which I have given above. I do not know whether the modification of 
Poincare's theorem which results when the condition that an integral invari- 
ant exists is replaced by this weaker condition is true. 
Harvabd University, 
October 20, 1912 



